SUBMISSION TO IEEE TRANSACTIONS ON INFORMATION THEORY 



1 



Capacity of Ultra Wide Band Wireless Ad Hoc 

Networks 

Rohit Negi, Member, IEEE and Arjunan Rajeswaran, Student Member, IEEE 



o ■ 
o 

< 



C/2 



> 

in 
0^ 
O 

00 
O 
in 
o 

O 



X 
J3 



Abstract — Throughput capacity is a critical parameter for the 
design and evaluation of ad-hoc wireless networks. Consider n 
identical randomly located nodes, on a unit area, forming an ad- 
hoc wireless network. Assuming a fixed per node transmission 
capability of T bits per second at a fixed range, it has been 
shown that the uniform throughput capacity per node r{n) is 

Q (^--j^==^, a decreasing function of node density n. 

However an alternate communication model may also be 
considered, with each node constrained to a maximum transmit 
power Pi) and capable of utilizing W Hz of bandwidth. Under 
the limiting case W ^ oo, such as in Ultra Wide Band (UWB) 
networks, the uniform throughput per node is O ^ (n log n) ^ 

(upper bound) and f2( — "'° ^j/^w, ) (achievable lower bound). 

^ (log n)'-"^^'/ ^ 

These bounds demonstrate that throughput increases with 
node density n, in contrast to previously published results. This 
is the result of the large bandwidth, and the assumed power and 
rate adaptation, which alleviate interference. Thus, the effect 
of physical layer properties on the capacity of ad hoc wireless 
networks is demonstrated. Further, the promise of UWB as a 
physical layer technology for ad-hoc networks is justified. 

Index Terms — Information theory, ad-hoc network, network 
capacity, ultra wide band. 



I. Introduction 

It has been shown [1] that the uniform throughput capacity 
per node of an ad hoc wireless network with n nodes de- 
creases with n as O ^--^=^=^, under certain physical layer 
assumptions. However, in this paper, it is shown that under 
an Ultra-Wide Band (UWB) [2] communication model (large 
bandwidth, Umited power), the uniform throughput capacity 
per node increases as G)(n("~^)/^), where a > 1 is the 
distance loss exponent. (The notation used is elaborated later 
in the paper.) Thus, this paper shows that, the design of and 
assumptions about, the physical layer of an ad hoc network 
can dramatically affect the network capacity. 

Wireless communication networks consist of nodes that 
communicate with each other over a wireless channel. Some 
wireless networks have a wired infrastructure of controllers, 
with nodes connected to a controller over a wireless Unk. Other 
networks, such as ad-hoc networks [3], are all wireless. Some 
salient features of ad hoc networks are speedy deployment, 
low cost and low maintenance. These features lend towards 
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applications such as sensor networks or military systems. The 
lack of centralized control is also advantageous for short-lived 
commercial networks. 

Ad-hoc network design is burdened with the issues of 
medium access (scheduling) at the link layer and relaying 
of data packets (routing) at the network layer The wireless 
medium or channel is a resource that must be shared amongst 
various nodes, a functionality achieved by Medium Access 
Control (MAC). The broadcast nature of the wireless medium 
and the decentralized nature of ad-hoc networks makes this 
scheduling problem very different from that in infrastructure 
networks. Interference mitigating techniques and distributed 
protocols have been considered in this regard [4], [5], [6]. 
Routing is the functionality of transporting data from the 
source to the destination across a sequence of links. In ad-hoc 
networks, routing faces a number of challenges, including the 
variability in topology due to the unreliable wireless link [7] 
and node mobility. These issues differ significantly from their 
counterparts in both cellular networks and wired networks 
(such as the Internet), and have been extensively studied [8], 
[9], [10]. Traditionally, rate and power adaptation, scheduling 
and routing are treated independently, as separate functions of 
the physical, link and network layers, respectively. In wireless 
ad-hoc networks, however, all nodes use the shared wireless 
medium resulting in a strong interdependency between pro- 
tocol layers. Therefore, in wireless ad-hoc networks, power 
adaptation, scheduling and routing require a joint consideration 
to optimize network performance [11]. In the general case, 
this joint formulation is a hard non-convex problem [11]. 
Thus, towards finding appropriate algorithmic solutions it is 
necessary to develop intuition based on information theoretic 
analysis. 

Recently, there has been significant interest in computing 
the capacity of ad-hoc networks [1], [12], [13], [14]. The 
assumed network model is a static ad-hoc wireless network, 
where n identical nodes on a unit area communicate over a 
wireless channel, with possible cooperation, to relay traffic. 
The assumed physical model is one where each link operates 
at a fixed data rate, utilizing a finite bandwidth and large power 
(i.e., large signal-to-noise ratio). Under these assumptions, it 
was shown in [1], that the per node throughput capacity is 
Q I , f I a decreasing function of the number of nodes n. 

\ sjn log n ) ° 

This pessimistic result, has motivated several works that have 
presented different results, under varied assumptions. Varia- 
tions on the network model, towards improved capacity results, 
have been explored. Reference [12] exploited node mobility, to 
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demonstrate that a constant (8(1)) per node throughput could 
be achieved in a mobile network at the cost of a large incurred 
delay. The issue of delay was addressed in [14], where the 
assumed mobility models were analyzed to provide bounds on 
the delay experienced by the packets. A comprehensive result 
was reported in [15], showing the optimal delay-throughput 
tradeoff for both static and mobile networks. Hybrid network 
models have been considered, where the inclusion of base 
stations (wired infrastructure support), cluster heads (wireless 
support) or a different number of source and destination nodes 
introduces an asymmetry into the network. This asymmetry is 
exploited and detailed analysis in [13] (and references therein) 
provides improved capacity bounds for these partially ad-hoc 
networks. Other variations at the network level include altered 
traffic patterns (e.g. single destination, relays). Simulation- 
based results on the capacity of small ad-hoc networks [16] 
confirm the theoretical results in [1], to a limited extent. 

Recently, physical layer characteristics and techniques have 
also been considered in the analysis of capacity. A constant 
order throughput increase, by the utilization of directional 
antennas, was demonstrated in [17]. Fading was analyzed in 
[18] and shown to cause a marginal (in order) loss in capacity. 
Sophisticated information theoretic ideas such as cooperative 
relaying and interference cancellation utilized in [19], resulted 
in bounds similar to [1], providing no gain. Thus, a direct 
application of physical layer techniques provides little gain. 

Currently deployed commercial wireless networks are built 
using either narrowband (e.g., cellular GSM) or wideband 
(e.g., 3G, 802.11) links. However, the demand for higher data 
rates at short distances has created a market for ultra-wide 
band links. UWB radios were designed for covert military 
applications, as a spread spectrum technology that used a 
large amount of bandwidth at extremely low powers, thus 
differentiating them from narrowband and wideband radios. 
However, recent changes in U.S. federal regulations have 
opened up UWB for commercial applications. Thus, there are 
currently intense research and development efforts underway, 
to design and standardize commercial UWB radios [20], [21], 
[22]. For example, the UWB based IEEE 802. 15.3. a standard 
[23] is expected to support 100 to 500 Mbps depending on 
the link distance. Further, UWB radios will be inexpensive and 
low power, making them ideal for ad hoc wireless applications. 
Commercial deployment of UWB networks is expected to 
occur in the near future [24]. 

Since UWB radios possess properties (extremely large 
bandwidths and low power) that are drastically different from 
existing commercial radios (finite bandwidth and large power), 
the question arises whether the existing results on ad hoc 
network capacity are applicable to UWB networks. Note that 
a system with infinite bandwidth does not imply an arbitrarily 
large link capacity, because of the finite power constraint (See 
Section lIVI l. Specifically, in this paper, an UWB communi- 
cation model is assumed, where each link operates over a 
relatively large bandwidth (W) and with a constraint Pq on the 
maximum power of transmission. The ambient Gaussian noise 
power spectral density is A^o and the signal power loss, with 
distance d, is l/d". Here a > 1, is the distance-loss exponent 
[28]. Nodes are assumed to remain static. This communication 
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model is analyzed to address the question of network capacity. 
Upper and lower bounds are derived, demonstrating that the 
network capacity is an increasing function of node density n, 
as where 8 stands for soft order^ , in contrast 

to [1], which shows a decreasing function for capacity. (As 
argued in Section|lIlJ this result is equally valid for low power, 
low data rate sensor networks, although they may not use 
a large bandwidth.) We demonstrated this dramatic effect of 
the (UWB) physical layer on network capacity first, in [25]. 
Subsequently, our result has been extended in [26] to tighten 
the lower and upper bounds to the same order, i.e., the uniform 
throughput capacity is 8(n("^^^/^). This was achieved by 
changing the assumed network model to one with a random 
Poisson number of nodes and applying the percolation theory 
developed in [27] to drop the logn terms in the bounds. The 
UWB network capacity result [25] has also been used as 
a benchmark to validate cross-layer optimization results for 
UWB networks, such as in [11]. 

The rest of the paper is organized as follows. In Section Hll 
the required background is reviewed. A short intuitive review 
of the relevant assumptions and proofs from [1] is presented. 
The concept of uniform convergence is discussed. Section |lll| 
presents the UWB communication model assumed, motivating 
the need for an analysis that is different from existing litera- 
ture. In Section IIVI important characteristics of the alternate 
communication model are presented and are used to show that 
the optimal MAC is Code Division Multiple Access (CDMA). 
Section |y] derives an upper bound on the uniform throughput 
capacity. This is achieved by analyzing an optimal routing 
scheme, under a relaxed power constraint. In Section I VII a 
lower bound is derived by applying some results of [I] to the 
new communication model. It is thus shown that the uniform 
throughput capacity r{n) is 8(n*^"~^'/^), in contrast to [1] 
where r(n) is 8(^4 — )■ Bandwidth scaling is addressed 

^ ^ ^ V n log n ' 

in Section IVIII as a guideline for practical implementations. 
Finally, conclusions are presented in Section IVIIII 

II. Background 

The objective of this paper is to demonstrate the effect of 
the ( UWB ) physical layer on ad hoc network capacity. This 
requires contrasting the assumptions and results of [I] with our 
results. Therefore, for the sake of completeness, this section 
reviews the assumptions and methods used in [1], that are most 
relevant for such a comparison. Readers familiar with [1] may 
skip this section. 

A. Network Model 

The assumed network model in [1], relevant to this paper, 
is one of Random Networks, where the n static nodes on a 
unit area are i.i.d. (independent and identically distributed) 
and distributed uniformly. To avoid edge effects, this unit area 
is considered to be on the surface of a sphere, S^. The n 
nodes communicate over a wireless channel, with possible 
cooperation, to relay traffic. Each of the n nodes has an 

'Here 0,f2 and © are the standard order bounds. The soft order © is the 
same as a © bound with the powers of log n neglected. 
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independent randomly chosen destination (chosen as the node 
closest to a random point, i.e. uniformly and independently 
distributed). 

B. Performance Metric 

All nodes require to send traffic at a rate of r{n) bits 
per second to their corresponding destinations. A uniform 
throughput r{n) is feasible if there exists a scheduling and 
relaying scheme by which every source-destination pair can 
communicate at a time-average of r{n) bits per second. The 
maximum feasible uniform throughput is the uniform through- 
put capacity, and is the metric of choice. The motivation for 
choosing this metric is a sense of fairness, since all nodes 
are assumed to be homogeneous in their capabilities and 
requirements. 

Since the underlying network is random, so is the capacity. 
The aim is to bound the random capacity by functions of n. 
Thus, bounds are shown that hold with w.h.p. (the abbreviation 
w.h.p. represents 'with high probability', i.e., with probability 
approaching 1 as n ^ oo). Specifically, the uniform through- 
put capacity r(n) is said to be of order 0(/(n)) if there exists 
deterministic constants fci > /cq > s.t. 

lim Prob( r{n) = kof{n) is feasible ) = 1 

n — *oo 

lim Prob( r(n) = kif{n) is feasible ) < 1. (1) 

n — *oo 

(ki ,Ki,Ci,Ci will be used to denote constants, with respect 
to n.) 

C. Communication Model: 

In [1], it was assumed that each node can transmit at a rate 
of T bits/s. The homogeneous nodes share a common range 
(equivalently, power) of transmission tr{n). This simplistic 
communication model assumes that all operating links trans- 
port data at a constant rate. The transmission by a legitimate 
transmitter Xi to its intended receiver Xj is successful, if their 
distances are related as , 



\Xk~X,\ > (1 + A) \X,-X, 



(2) 



for every other Xk transmitting simultaneously. This interfer- 
ence criterion models a protocol, which specifies a guard zone 
A, around the receiver where no other node may transmit, 
and is termed as the 'protocol model'. A second model, 
more directly related to physical layer design, works with 
the Signal to Interference Noise Ratio at the receiver, with 
the assumption of arbitrarily large power. (Note that the 
assumption of arbitrarily large power is in sharp contrast to the 
UWB model assumed in this paper.) The order results remain 
the same under both the 'protocol and physical' models. Note 
that both these models for the physical layer do not allow for 
rate and power adaptation. For simplicity this review assumes 
the protocol model. 

D. Network throughput 

With these assumptions it is proven in [1] that the capacity 
of random ad-hoc networks 

r(n) is 8( 



Thus the throughput per node decreases with increasing node 
density. The essential reason for this capacity decrease is the 
requirement for all nodes to share the wireless channel locally. 
This may be demonstrated by a contrast between the MAC and 
routing requirements. The mean source-destination distance is 
assumed to be L. The mean number of hops taken by packets 



T 



log n 



(3) 



is j^!^- The total traffic generated by all nodes due to the 
multi-hop relaying (routing) is n r{n) j:^^ bits/s. This traffic 
is required to be served by all nodes. However, the capacity of 
each node is reduced by interference (MAC), since nodes close 
to a receiver cannot transmit simultaneously. The interference 
radius is proportional to the transmission radius Q. Since the 
number of interfering nodes is proportional to the interference 
area (uniform distribution), the capacity loss is quadratic in 
the transmission radius tr{n). Thus, the available capacity 
reduces from riT to . T yi . The tradeoff between the routing 
requirement and the MAC restriction yields that the capacity 
of the network 

A lower limit on tr, due to the requirement of network 
connectivity, has been derived as tr{n) > \J^^- The lower 
limit on tr{n) ensures that no node in the network is isolated 
w.h.p.. This concept of connectivity in a network arises due 
to the assumed physical layer model, that requires a fixed rate 
of transmission and hence a minimum distance to allow for a 
link. The lack of isolation is required, since the performance 
metric is one of uniform throughput capacity, which would be 
zero if there was even a single starved node. The application 
of this limit to Q results in the capacity upper bound as 

r{n) < fc; ^ . (5) 

E. Lower bound 

To provide a capacity lower bound (feasibility), specific 
MAC and routing schemes are chosen. These schemes achieve 
the same order of capacity as the upper bound. The lower and 
upper bound prove the order bound (|3} on r(n). 

To elaborate, it is required to specify schemes for both 
the MAC and the routing on this random network. Such a 
specification requires imposing some structure on the random 
network. Motivated by cellular architectures, a tessellation 
(covering by 'cells') of the unit area is considered. Regularity 
in the tessellation (in the properties of every cell) necessitates 
a regular cell shape. However, since the network is random, 
some deviation from a regular cell shape should be allowed to 
ensure that the tessellation may be made as fine as required. 
A Voronoi tessellation Vn- of the surface S'^ is a tessellation 
that has the desired properties. The tessellation (such as in 
Figure [3 has the following properties : 

1) Every Voronoi cell V contains a disk of area ^°"^°s" 
and corresponding radius p{n). 

2) Every Voronoi cell is fully contained within a circle of 
radius 2p{n). 

The existence of such a tessellation was proved in [1]. The size 
of each cell, relative to the number of nodes, is important. It 
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is required that every cell contain at-least one node, to ensure 
that the routing scheme (described below) is feasible. It is this 
uniformity restriction that results in the choice of cell size. 
The resulting 'cellular- like' architecture imparts some notion 
of regularity on the random network. 

MAC and available capacity: With the Voronoi tessella- 
tion, a MAC is defined that achieves a scheduling between 
the cells. The MAC ensures that transmissions from a cell do 
not interfere with transmissions in simultaneously transmitting 
cells. The radius of transmission is chosen to be tr{n) — 8p{n) 
to allow for direct transmissions between adjacent cells (cells 
sharing a point) and within a cell (Figure [3. Cells containing 
nodes within a distance of (2 + A)tr{n) are interfering cells, 
since a node in one cell may interfere with the transmission 
in the interfering cell (|2j. The distance between two nodes 
in interfering cells is upper bounded. Also, the area of each 
interfering cell is lower bounded, by the tessellation properties. 
The ratio of the maximum interference area and the minimum 
area of each cell is a constant, k^. Thus for every cell in 
the tessellation the total number of interfering cells may be 
upper bounded by a constant which depends only on A 
(the parameter of the interference model). Consequently the 
graph defined by interference amongst cells, has a bounded 
degree of k^. 

The chosen MAC is a schedule of 1 + fcs slots, in which 
each cell is assigned one slot to transmit. This is possible 
since a graph with degree not greater than k^ may be colored 
by 1 + fca colors [29]. Thus, the 'cellular-like' architecture is 
utilized to achieve a simple slotted MAC amongst the cells. 

Therefore each cell has an available capacity of 



available capacity = 



bits/s . 



(6) 



Routing: every source destination pair may be connected by 
a straight line segment (segment of a great circle on S^), as in 
Figure n The packet routing scheme employed is as follows. 
Packets originating from a source are relayed from the cell 
containing the source to the cell containing the destination in 
a sequence of hops. In each hop, the packet is transferred from 
one cell to another, in the order in which cells intersect the 
straight line segment connecting the source and destination. 
A node (head/relay node) is chosen randomly in each cell 
to relay all traffic. Within a cell all sources send traffic to 
the head node and destinations receive traffic from the head 
node. This choice of routing is independent of the MAC and 
hence the two are analyzed separately. This separation of the 
MAC and routing may not necessarily be the optimal strategy 
(from a maximizing throughput perspective), however here it 
is required to show some scheme that achieves the same order 
as the upper bound. 

To make relaying of traffic between cells feasible, it is 
required that every cell in the tessellation V„ contains at 
least one node w.h.p. (uniformity requirement). A simple 
union bound of the probabilities that every cell contains at 
least one node is insufficient, and hence, a more intricate 
technique is required to provide this uniformity. The appendix 
reviews Vapnik-Chervonenkis (VC) theory, which provides the 
required uniform convergence in the analysis of the uniform 



throughput capacity. 

Routing and traffic to be carried: The VC theorem ( l33t 
obtained from VC theory may be applied to the tessellation 
of the network. A set of disks (of fixed area) have a VC 
dimension of three. Each cell in the tessellation contains a disk 
of radius p{n) and is contained in a disk of radius 2 p{n) . The 
application of the VC theorem to the set of disks (contained 
in the cells) yields 

Probf sup |iV(y) - lOOlognl < 501ogn 

>1-^^, (7) 
n 

where N{V) is the number of nodes in cell V. The result (ap- 
plied to both the sets of disks) implies that w.h.p., the network 
G is such that for every Voronoi cell in the tessellation, the 
number of nodes per cell obeys 50 logn < N{V) < 150 logn. 
This result allows for the viability of the routing scheme, by 
guaranteeing a node in every cell that can serve as the head 
node, and justifies the choice of cell size as i22i2gli 

The traffic generated due to this routing scheme is consid- 
ered. The random sequence of straight-line segments is i.i.d 
and hence the weak law of large numbers may be applied to 
the routes which approximate these line-segments. The traffic 
to be carried by a cell is proportional to the number of straight 
line segments passing through the cell. The number of routes 
intersecting every cell maybe bounded w.h.p.. Thus, the traffic 
to be carried by every cell can be upper bounded w.h.p. as 



sup (Traffic carried by cell V) 



< k4 r(n)y^ nlogn. 



(8) 



Bounds: The lower bound is derived by constraining the 
traffic to be carried (|8j, obtained from the routing require- 
ments, to be less than the available capacity (|6}, obtained from 
the MAC constraint. Thus for random networks 



r(n) — fcs 



T 



(9) 



y/n log n 

bits/s is feasible w.h.p.. The upper bound, obtained by the 
requirement for connectivity, also presented the same order 
dl} and hence r(n) is of order 6( , ). 

III. UWB NETWORKS 

Next, the capacity of UWB random ad-hoc networks is 
considered. The assumed random network model as described 
in Section UH is the same as in [1]. In contrast to existing 
literature, the following UWB Communication Model is 
assumed to model the physical layer: 

1 ) Power: Each node is constrained to a maximum transmit 
power of Po- 

2) Bandwidth : The underlying communication system has 
an arbitrarily large bandwidth (W oo). 

The key characteristic of such a model is the low spectral 
efficiency (i.e., jf^ ^ 1), which implies a relatively large 
bandwidth [28] or a relatively tight power constraint. The 
results of this paper are applicable to all systems that have 
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a low spectral efficiency. Thus, in particular, the results hold 
for two practical applications, 

1) UWB systems, where the bandwidth used is of the order 
of a few GHz, such as in the IEEE 802.15.3a standard 
[23]. For such a system, W oo, which implies that 
■ , ^ 1. As mentioned in the Introduction, such 

iVo w 

systems are actively being considered for commercial 
deployment. 

2) Sensor networks, with bandwidths of the order of a few 
MHz or less, but which use very low power devices (to 
extend battery life). For such a low power system, Pq 

0, which implies that <C 1. Such networks are 

being considered in both military as well as commercial 
applications [30]. 

As noted in Section |l] an ambient Gaussian noise power 
spectral density of A'o and a signal power loss of l/d", with 
distance d, is assumed. Here a > 1, is the distance loss 
exponent. Shadowing effects are not considered in this model. 
Capacity-achieving Gaussian channel codes are assumed for 
each link. Thus, each link is assumed to support a data rate 
corresponding to the Shannon capacity^ of that link [31]. It is 
assumed that each node can transmit and receive simultane- 
ously (although this restriction does not affect the results, as 
shown later). Also, each node can control its transmit power, as 
well as adapt its data rate to the link condition [32], [33]. Every 
node may transmit or receive, and wishes to communicate with 
a randomly chosen destination (chosen as the node closest to 
a randomly chosen point). 

It has been shown in [1] that r{n), the uniform throughput 
capacity per node, is a decreasing function of n (|3}, under 
a simplistic fixed per-link data rate. In the communication 
model assumed here, the constraints on power and bandwidth 
are different from [1], the link capacity explicitly depends on 
distance, and each link is allowed to adapt its power and rate. 
Therefore, the results of [1] are not applicable in our case. 
Thus, in contrast, in this paper it is shown that under the new 
communication (physical layer) model, the capacity of the ad- 
hoc network increases as a function of the node density nl 

To demonstrate this result, the characteristics of the com- 
munication model are studied. This includes a presentation 
of the optimal MAC, followed by an analysis of the routing 
problem that provides the required upper and lower bounds on 
the uniform throughput capacity. 

IV. Optimality of CDMA MAC 

The interference problem in the ad hoc network is first 
addressed. It is shown that the interference perceived by a 
receiver is bounded w.h.p., and hence, a certain scaling of 
bandwidth W, as a function of n, renders the interference 
negligible. This, implies that under the limiting bandwidth 
assumption, a 'CDMA MAC scheme' is optimal, i.e., all 
transmitters transmit at the same time, using the entire band- 
width. Here 'optimal' is used in comparison to time/frequency 
scheduling schemes as noted subsequently. 

^The Shannon capacity r for a link with Gaussian noise and interference 
sources is, r = VFlog(l + SINK), where SINR is the signal-to- 
interference noise ratio of that link. 



A. Bandwidth Scaling 

Let Xi denote the node and its position. Let > be the 
transmit power chosen by node Xi to transmit to its chosen 
receiver Xj, over link Xi Xj. The node power constraint 
Pq implies that P,; = J2j Pij ^ Po- The wireless medium 

where 



causes a power loss g,;j, given by gij — \x -x-\° 
other physical constants like antenna gain have been absorbed 
into Nq. The distance \Xi — Xj\ is defined as the length of 
the segment along the great circle, connecting Xi and Xj, on 
the surface S^. The signal-to-interference noise ratio at the 
receiver Xj is [31] 



SINR 



ij9ij 



(10) 



where / is the set of all interfering nodes (the set of 
all simultaneous transmitters). It is required to bound the 
interference, so that a certain bandwidth scaling can render 
the interference negligible with respect to ambient noise. The 
problem stems from the fact that potentially, a node arbitrarily 
close to the receiver (i.e., Xk s.t. \Xk — Xj\ 0) could 
cause arbitrarily large interference. This is, however, a very 
low probability event. Specifically, let the random variable 
dmin{G) denote the minimum distance (on the surface S"^ of 
the sphere ) between pairs of nodes in a specific realization G 
of the the network. The following lemma shows that dmin (G) 
cannot be very small. 



Lemma 1: Prob 
Proof: 



— log n 



Prob dmin{G) < 



n^/logn 



= Probf \J{\X,-X,\ < 

y ^j nVlogn 

< EP™bf|X.-X,|<--l 



t>3 



(a) 



'n? log n ' 



(11) 



where (a) arises because the uniformly distributed node Xj 
has to lie within a disc of radius — ,} centered on Xi. 

1 _ 



Thus w.h.p., d„ 



log ri 

i(G) of network G exceeds 



/log n ' 

Noting that Pk < Pq, \I\ < n and gij < (n'^logn)^ 
(from ^^), the total interference can be bounded w.h.p. by 
Pon{n^logn)^ . Thus, setting W — Q{n{n^logn)^ ) renders 
the interference negligible with respect to ambient noise. ( 
Section fvTll discusses a practical bandwidth scaling.) 

The above bandwidth scaling ensures that there is no re- 
quirement to schedule transmitters, since they cause negligible 
interference to each other. This bandwidth scaling (which 
implies that — > oo, as ^ oo), allows for a CDMA 
MAC, where all nodes may transmit simultaneously. It is 
proved below that the CDMA MAC is indeed an optimal MAC 
scheme for such an ad hoc network. 
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B. Optimality of CDMA MAC 

The optimality of the CDMA MAC is in the sense that 
it performs at least as well as any other optimal scheduling 
scheme, which assigns time slots and frequency bands to 
various nodes (TDMA/FDMA), as shown below. 

Since the bandwidth is arbitrarily large, each link's Shannon 
capacity is proportional to the received power, as below. 



r-rj = lim M^log(l 

W^oo 



Pijdij \ _ Pij9ij 



No 



(12) 



Throughout this paper, log(-) denotes logg( ) and capacity is 
expressed in units of nats [31]. As il2\ shows, although the 
bandwidth is infinite, the link capacity is bounded, due to the 
finite power constraint, a classical result in communication 
theory. 

Now, assume that there exists a TDMA/FDMA scheduling 
scheme, which coupled with some routing scheme, achieves 
the maximum possible uniform throughput. Consider the fol- 
lowing generic partition of the allotted bandwidth W and 
the time frame (normalized to unity), corresponding to this 
optimal TDMA/FDMA solution; {Wk,k = 1,2,..., K}, s.t. 
Ek Wk = W, and {ft, t = 1, 2, . . . , T}, s.t. ft - 1. The 
assumed optimal TDMA/FDMA scheduling scheme partitions 

k t 

the total power assigned to link Xt Xj, as P^f s.t. 
S/c t ftPif = Pij- Thus, Pf-'* is the power assigned to the 
hnk in the t*'' time slot of length ft, and over the fc*'^ frequency 
band of bandwidth Wk- The following theorem shows that a 
CDMA MAC is indeed optimal. 

Theorem 2: For each link Xi — > Xj, the rate achieved 
using a CDMA MAC scheme is not less than that achieved 
using the ptimal TDMA/FDMA scheduling scheme. 

Proof: Consider a particular link Xi Xj. Since 
the rate achieved is upper bounded by the capacity in the 
absence of interference, the rate achieved on this link by the 
TDMA/FDMA scheme is bounded as. 



K T 

TDMA/FDMA , 



^ 1^1^ ftWk log(l 
fe=i t=i 



NoWk 



(a) 
< 



K T 



k=l t=l 

PiA Qii 



Plj 9ij 

No 



No 



(13) 



where (a) arises since a;log(l + ^) < C. Pij < Po is the 
total power assigned to link Xi Xj. Since ^^^^'^ is the 
rate achieved by the CDMA MAC scheme Sl2\ . the theorem 
is proved. ■ 
It is thus proven that the arbitrarily large bandwidth renders 
interference to be negligible and the resulting CDMA MAC 
is optimal. It should be noted that making an ad-hoc wireless 
network interference free, alone, is insufficient to drastically 
improve its capacity. Utilization of highly complex signal 
processing to generate arbitrarily narrow directed beams at 
transmitters and receivers results in the removal of all inter- 
ference. However, even such an impractical technique was 
shown [34] to provide a capacity gain (compared to [1]) 



of only 8(log^(n)). As is demonstrated below the assumed 
UWB physical model provides a combination of mechanisms, 
including rendering interference negligible, that results in a 
drastically improved capacity. 

The (optimal) CDMA MAC scheme is assumed in the sub- 
sequent sections. Essentially, this results in a clean separation 
of the MAC and routing problems, i.e., it remains to consider 
optimal routing of the source-destination pairs, with the links 
scheduled using the CDMA MAC. Since the bandwidth is 
arbitrarily large, the key constraint is no longer bandwidth, but 
rather the power of the nodes. Thus, unlike [1], which analyzed 
the distribution of bandwidth among the different links, in our 
case, the distribution of power among the competing links and 
routes needs to be analyzed. 

V. An Upper Bound on throughput capacity 

With a CDMA MAC, the optimal routing consists of finding 
source-destination routes for all sources, that achieve the 
uniform throughput capacity. The difficulty here is that, the 
per-node power constraint results in a coupling between the 
route selections for different sources. However, interestingly, 
as opposed to the classical routing problem in wired networks 
[35], the constraint is not in terms of the capacities of indi- 
vidual links, but rather, in terms of the total power transmitted 
by each node. An upper bound on throughput capacity is 
derived in this section, by analyzing such a 'power-constrained 
routing' problem. 

The upper bound (Section|llJ in [1] was derived by bounding 

the minimum transmission radius as tr(n) > \/^^^^. 
It was shown that violation of the bound on tr{n) would 
result in an isolated node (all neighbors being beyond tr{n)), 
causing the uniform throughput capacity to be zero. However, 
in this case, due to link adaptation il2\ . there is no concept 
of node isolation, or network disconnectivity, since the link 
capacity simply decreases with distance, but is always non- 
zero. Therefore, a more sophisticated method, which can 
analyze the optimal power-constrained routing in detail, is 
required to upper bound the throughput capacity. 

A. Traffic Routing 

The routing problem is to find a set of routes for each 
source-destination pair, and to find the power to be allotted 
to each link along these routes, to maximize the uniform 
throughput capacity of the network. Under the optimal CDMA 
MAC, each link's Shannon capacity is proportional to the 
received power (I12t . Thus, 

_ Pij9ij 



No 



^ P^j ^n,No X,- X, 



(14) 



The coupling of the various routes (due to the per-node 
power constraint) complicates the routing analysis. Therefore, 
towards obtaining an upper bound, the power constraint is 
relaxed from a constraint on each node, to an average (or 
equivalently, total) power constraint. Thus, assume that 



P, < nPo, > Vz, 



(15) 
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instead of Pi < Pq \f i. 

Consider the source node Xi and the set of all possible 
routes from this source to its final destination (recollect that 
each source is assumed to have exactly one destination) . Note 
that all links on a specific route must operate at an equal data 
rate. For, if this were not the case, a redistribution of power 
amongst the links (while maintaining the total power utilized) 
would result in a new rate which is at least as large as the 
previous rate. (Such a redistribution is possible due to the 
relaxed power constraint dlSll.) Therefore, each route can be 
associated with a single rate. 

Thus, assume some optimal power distribution amongst 
the set of all routes for a given source-destination pair, for 
each pair, that results in the maximum uniform through- 
put. Can this power distribution be characterized? Consider 
two routes corresponding to a given source-destination pair, 

Xi Xf , as Ri = [X°XlXf Xf ] and R* = 

[X°Xl*Xf* Xf *] where Xf Xf* = X is the source 

and Xf^ — X^* is the destination. Let ri{n) and r*{n) be 
the rates achieved on these routes (i.e., on every link of each 
route) respectively. The route R* is defined as the route for 
which 

f]ixf -xf-i*r (16) 

k=l 

is the minimum of all possible routes from the source to its 
destination, i.e., R* is the minimum power route for the chosen 
source-destination pair From (I14> . the total power used on 
these routes is respectively 

k=K 

P{R.) =n{n)N,{Y, |Xf-X^T), 
fc=i 

k=K* 

P{R*) = r*(n)Xo( ^ \Xt ~ Xt'*C). (17) 

k=l 

If the power P{Ri) is shifted from the route Ri to R*, by 
scaling the power of each Hnk Xf * Xf+^* of R* by a 
factor (1 + p^j^t j ), the relaxed power constraint ( I15> would 
still be met, while the achieved rate on R* would be not 
less than ri{n) + r*{n). This follows from (O and from the 
definition il6i of R* . Thus, under the relaxed power constraint 
(I15> . it is sufficient for each source-destination pair i to choose 
the minimum power mute R* to route all its traffic, so as to 
maximize its rate. Further, different source-destination pairs 
make their choice independent of other pairs. Note that such 
a simplification in routing is not possible with the original 
per-node power constraint (Pi < Pq V i)- 

Therefore, the exact uniform throughput capacity r"(n) 
under the relaxed power constraint may be obtained by setting 
r*{n) = r"(r7,), V i, and solving = "-^o. where 

P{Ri) is given by Ml\ . This routing scheme will be referred 
to as 'Minimum Power Routing'. In general, this may not 
coincide with shortest-path routing. 

The uniform throughput capacity with the per-node power 
constraint r{n) satisfies r{n) < r"(n). The objective of this 
section is to upper bound r{n), which is achieved below by 
upper bounding r^{n). 



As an aside, the per-node power-constrained routing prob- 
lem may be posed as a convex optimization problem. The 
problem is similar to the classical joint optimal routing prob- 
lem for wired networks [35], but differs in that the constraints 
are per-node power constraints, rather than per-link capacity 
constraints. Decentralized algorithms to solve the classical 
joint optimal routing problem [35], and the resulting practical 
routing protocols, may point to similar solutions for our power- 
constrained routing problem. This will be the subject of future 
investigation. 

B. Maximum number of nodes on a route 

As described above, the 'Minimum Power Routing' scheme 
chooses the minimum power route R* (I16> for each source- 
destination pair i. To bound r"(n), the maximum number of 
hops in R* is required. Intuitively, if it were possible for 
K* to have a large number of short hops, then potentially 
the throughput capacity can become very large, due to rate 
adaption M2\ . 

Denote Di as the distance between the source Xi and its 
destination X/^* (measured on S^). By the triangle inequality, 
the sum of the hop-lengths Li of path R* is lower bounded 

by Di as, 

k=K* 

U= J2 - \>\X^- Xf* I = A (18) 

k=l 

Consider the Voronoi tessellation of the network, as described 
in Section HI] Note that p{n) is the radius of the circle with 
area " on the surface of a sphere S'^. Also note that 

, , , / 3200 log n 

4p n <W ^. (19) 

V im 

This is because a circle of radius p on S'^ has an area less 
than Trp^ and more than ^p^- The following lemma will be 
used to bound the number of nodes on R*. 

Lemma 3: The number of Voronoi cells Nmax that in- 
tersect a minimum power route R* is upper bounded by 
32 + -i^ii^. 

Proof: A particular node Xi is considered, along with its 
corresponding optimal route R* . Define a region C [R* ) C S*^ 
as follows. 

Y eC{R*) iff 3 Z e R* s.t. \Z -Y\ < 4/7(n), 

here Y and Z are points on S^. 

C{R* ) defines a coverage region around the route such that 
all cells intersecting the route have to be fully contained within 
this coverage region. We now bound the area of the coverage 
region. Corresponding to a route, each link contributes a band 
(rectangular region) of width Ap{n) and length \Xf* —X^^^*\ 
to the coverage region. Also, the edge links contribute an 
additional two semi-circular regions of radius Ap{n). This is 
demonstrated in Figure |2] Thus, the total area is bounded as 

. 32001ogn „ / 3200 log n ^ 

Area C P* )) < — + 2\ —Li (20) 

n V Trn 
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The minimum area of a Voronoi ceU is (Section 
Since the route can only intersect cells that are completely 
contained in C{R*), the number of such cells is upper bounded 
as 

Area(C(i?*)) 



N„ 



< 



< 32 



Minimum area of a Voronoi cell 

l6Lix/n 



10 Vtt Vlog n 



Using the result 0, with probability exceeding 1 — " , 
every cell in the tessellation contains at most 150 logn nodes. 
Thus, the maximum number of nodes on R* is bounded w.h.p. 
as 



N. 



nodes 



< (150 logn) X (N^ax) 

< (ci log n + C2 Li y/n logn) 



(21) 



C. Upper bound on throughput capacity 

The power P{R*) (seeO utilized on R* is related to the 
length of the route and the rate achieved on that route. This 
relation is obtained as 

P{R*) = r-{n)No ^ \Xt ~ X^'T 



k=l 

max 

> r{n) No 



{ci log n + C2 Li y/nAogn)"' ^ 



> r{n)Naf{U), 



(22) 



where (a) is because of the convexity of ?/" for a > 1, 
K* < N^°f^^'' and from (b) is from (EJ} and because 
r"(n) upper bounds r(n) . (c) is from defining f{Li) = 



(ci log n+C2Li^/n log n)" ^' 

As a final step in deriving the upper bound, the expected 
total power required by all the n routes, over the ensemble 
of graphs G, is bounded by the total available power nPo- 
Therefore, by symmetry, the expected power of each route 
P(i?*) is bounded by Pq as 

Po > EP{R*) 

> r(n)iVoE/(A) 

ib) 

> r{n)No Prob(A > e) E(/(A) | A > e) 

> r(n)iVo (1 - e') E(/(A) | A > e) 

> r(n)iVo ( 



2 ) 



ynlogn)"^ 

where (a) is from ( I22> . the fact that f{Li) is an increasing 
function and from (I18> . Inequality (b) comes from the con- 
ditional expectation. Nodes are distributed uniformly on 5^, 
and hence the probability that the distance between a source 
destination pair exceeds e, is lower bounded by (1 — e^), which 
results in (c). When Li exceeds a constant e, the C2Li^J nlogn 
term in the denominator of f{Li) dominates, resulting in (d). 



Recollect that Di is the physical distance on S"^ between 
the source i and destination. Since EZJj is a constant, J23t 
results in 

r{n) < C4PQ{n\ogn)^ (24) 



with probability exceeding 1 — 
for sufficiently large C5, 



50 



Ids 



Thus, w.h.p.. 



lim Prob 

n — »oo 



r(n) = C5(nlogn)' 



is feasible 



= 0. 

f .-.^ (25) 

This proves the upper bound, r(n) — O I (nlogn)"^ j. The 
upper bound was achieved by analyzing the UWB physical 
model. The large bandwidth was exploited to demonstrate 
the negligible effect of interference. The allowed power and 
rate adaptation were analyzed to develop an optimal power 
constrained routing strategy, resulting in an upper bound 
on the uniform throughput capacity. As noted earlier, it is 
the combination of these mechanisms that allows for the 
demonstrated bound, an increasing function of n . 

D. Area Scaling 

The area of the network has been normalized to unity in 
the analysis above. Thus, the node density increases linearly 
with n. Consider an alternate scenario where the area of the 
network A increases with n, as uAq. This could represent a 
situation such as smart homes, where the node density Aq 
could be a constant. This results in a scaling of all distances 
by \/nAo. The upper bound under this area scaling may be 
obtained by following the arguments of the previous sections. 

The probabilistic arguments for routing optimality and the 
number of Voronoi cells intersecting a route remain the same. 
These arguments are independent of the absolute distances. 
The distance scaling, however, affects the relationship between 
power and capacity. Following the arguments for the upper 
bound, we note the point of departure is that, under the new 
scaling, EL.; = Q{y/n), and so J24> must be modified to 



i?(n) < CgPr 



(logn)- 



(26) 



where R{n) is the uniform throughput capacity under the new 
area scaUng, with a corresponding modification in i25\ . 

VI. Lower bound on throughput capacity 

To provide a lower bound on the capacity, the techniques 
reviewed in Section HI] will be useful. The MAC scheme is 
again chosen as the CDMA MAC, since that was shown to be 
optimal in Section Hvl 

We need to demonstrate a feasible routing scheme to 
provide the lower bound. The routing scheme chosen is the 
same as in [1]. Thus, as reviewed in Section |ll| a route is 
selected for each source-destination pair by following the 
minimum distance path (segments of great circles), as closely 
as possible. For such a routing scheme, the number of routes 
intersecting any cell maybe bounded w.h.p., similar to [1]. 
Thus, the traffic to be carried by a cell may be upper bounded 
w.h.p. as (|8}, reproduced below for convenience. 
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supy^y^ (Traffic carried by cell V) 



(27) 



Traffic is relayed from cell to cell till it reaches the cell 
of the destination node. However, each relay node has a limit 
on its available capacity. This limit arises due to the power 
constraint of the node, unlike [1 ], where the capacity limit 
arose from the bandwidth constraint of the network. From the 
Voronoi tessellation, we know that every cell is contained in 
a disk of radius 2p, and so the length of each hop, to reach 
the next relay node, is at most 8p. Thus, from J14> . the relay 
node has a total capacity bounded as. 



> 



(28) 



No V logn 

The trade off between the traffic to be carried ( I27> . obtained 
from routing requirements, and the available capacity ( I28> . 
obtained by power constraint provides the lower bound. Thus, 
from i27\ and ( I28l l. a uniform throughput r(n) is feasible 
w.h.p. if 



ki^r{n)\/n \ogn < 



That is. 



Prob r(n) = cy- 



n 2 



iVo Vlogn 



is feasible = 1. 



(29) 



(30) 



(logn) 2 

This proves the lower bound, r{n) ~ Vl^ i^-^^^^y 

As a side note, it was assumed that each node can transmit 
and receive simultaneously. However, this restriction can be 
easily circumvented, by assuming that each link transmits 
over only half the bandwidth (chosen randomly). Then, the 
transmission by a node can be thought of as causing an 
erasure in its own received signal. Thus, as long as each 
link is encoded with a rate-i erasure correction code [31], 
the throughput will reduce by a factor of at most two, thus 
satisfying the same order bounds. 

Area Scaling : As in the case of the upper bound, we can 
derive the lower bound under the area scaling A ~ uAq, for 
which node density is constant. The available capacity (I28> is 
altered due to the dependence of gain dj (where dj is used 
to represent the gain under the new scaling) on the absolute 
distance measure. Accounting for the scaling, the uniform 
throughput is bounded as 

R{n) < ^ (31) 

Thus, for the case of constant node density, both the upper 
bound and lower bound are decreasing with n. The intuitive 
reason for this is the explicit capacity-distance relationship 
(I14> . where capacity decreases with distance. 

Vll. Practical Bandwidth Scaling 

The large bandwidth W — O(n{n^logn)^), assumed in 
Section |IV1 to prove the optimality of CDMA MAC is re- 
strictive. However, practical bandwidth scaling schemes can 
be developed for the unit area and area scaling cases, that 



require smaller bandwidth. It only needs to be shown that the 
lower bound is achievable with a smaller bandwidth. 



Unit Area case: Let the bandwidth scaling he W — 
Wn. Allot each node a unique disjoint frequency band of 
bandwidth W, disjoint with the bands of other nodes (i.e., 
a FDMA MAC). Thus, the capacity of hnk Xi Xj is 



l^log(l 



NqW 



). A bandwidth of W = 0{n 2) is sufficient 



to ensure that the capacity is approximately linear in the 
received power. Therefore, the capacity under this FDMA 
MAC approximates the CDMA MAC link capacity iTH with 
a bandwidth W — il.{n^s~). 



Area scaling case: When the area is scaled as ^ = nAo, 
an efficient bandwidth scaling may be obtained, by choosing 
a hybrid FDM A/CDMA MAC. Recollect that the area of each 
Voronoi cell is 100 log n in this case, due to area scaling. Form 
a graph G', with nodes representing vertices, such that two 
vertices are connected if the corresponding nodes are within a 
distance of cn^/Togn of each other, for some large constant 
cii. Then, the number of cells that have a node connected to a 
given node is a constant. Each cell has at most 150 logn nodes. 
So, w.h.p., the degree of G' is upper bounded as ci2logn — 
1. Now, consider an FDM A/CDMA scheme where the total 
available bandwidth W = ci2WQ\ogn is partitioned equally 
into C12 logn disjoint frequency bands (FDMA). One band of 
width Wq is allotted to each node, such that no two nodes 
that are connected in G' are allotted the same band. A simple 
greedy algorithm can achieve such a graph coloring [29]. Thus, 
the MAC chosen is FDMA locally, while CDMA is used to 
handle the interference from outside the local region. It needs 
to be shown that Wq can be chosen so that the interference 
from nodes using the same frequency band (all of which lie 
outside the local region) is rendered negligible. 



To this end, consider the interference caused to a given 
(receiver) node, by nodes using the same frequency band. 
Consider the annulus regions formed by circles of radii p, — 
(cii — 10) iy/\og n, i > 1, centered on the receiver node 
under consideration. The number of Vorpnoi j^ells in each 
annulus can be upper bounded by C13 ''^^^^ . Even with 
the pessimistic assumption that every cell outside the circle 
of radius p\ has one node interfering with the center node 
(it cannot be more than one due to the local FDMA), the 
interference from eagh anpulus caused to the center cell is 
upper bound by C14 ^pi\^^^ ■ 



Since the total number of annuli is Ci^./r^^— the total 
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interference at the center node is therefore upper bounded as. 



Interference < C14 



i°s " 2 _ 2 
Pi+1 Pi 



pf log n 



C16 



< 



< 



(a) 
< 



C16 



2i + 1 



E 



— 1 




for any a > 2 



C17. 



(32) 

where (a) arises from the bound X]i'=i ^ ^ 1 + log(y)- 
Thus, a sufficiently large constant per-node bandwidth Wo 
is sufficient to render interference negligible with respect 
to noise iVgWo, if a > 2. Since there are 6(logn) fre- 
quency bands required, the total required system bandwidth is 
W = 0(log ri). Intuitively, with area scaling, the closest and 
(therefore) dominant interferers are moved away, resulting in a 
smaller interference. However, as demonstrated in Section fVll 
this area scaling results in a decreasing capacity function. A 
practical bandwidth scaling for a < 2 does not seem obvious. 

VIII. Conclusion 

In this paper, the capacity of an UWB (large bandwidth) ad- 
hoc wireless network with a power constraint was studied. Ex- 
amples of such a network include UWB and sensor networks. 
It was shown that for such a network, consisting of n randomly 
distributed identical nodes over a unit area, with probability 
approaching one (as n 00), the uniform throughput capacity 
r{n) is O {(n\ogn)~^ (upper bound) and ^( (i^g'°)(o+i)/2 ) 
(lower bound). Thus, the throughput capacity r{n) for such 
a random ad-hoc network is 8(n'^"^^)/^). Interestingly, this 
bound demonstrates an increasing per-node throughput, in 
comparison to the decreasing per-node throughput shown in 
[1]. The key reason for this dramatically contrasting result 
is that the assumed UWB physical model is characterized by 
finite power, large bandwidth, and the explicit use of link adap- 
tation. These characteristics combine to alleviate interference 
and exploit node density effectively while delivering traffic. 
Thus, the properties of the physical layer dramatically alter 
the ad hoc network capacity. 

Practical bandwidth scaling results were derived to show 
that the assumption of arbitrarily large bandwidth is not 
excessively restrictive. Further, the optimal MAC and routing, 
which can achieve network capacity, were specified - namely 
a CDMA MAC and a power-constrained routing. The demon- 
strated results and protocols may be utilized as a guideline for 
practical UWB network design. 

IX. Appendix 

The ideas of Vapnik-Chervonenkis (VC) theory are reviewed 
in this section. A finite set of points X (such as nodes) of size 



n and a set of subsets T (such as Voronoi cells) is considered. 
X is said to be shattered by T if for every subset B of X there 
is a set FeT such that X f]F — B. T generates all subsets 
of X. The VC-dimension of T is defined as the supremum 
of the sizes of all finite sets that can be shattered by T [36], 
[37]. 

An underlying probability distribution {PD) is assumed. 
An i.i.d sequence X = Xi...Xn is chosen with distribu- 
tion PD. The relative frequencies of events are N{F) = 
i Y^^=i I{XieF). Sets of finite VC-dimension obey a uniform 
convergence in the weak law of large numbers, i.e., relative 
frequencies of the events converge to their probabilities uni- 
formly. Formally, 

VC THEOREM: If is a set of finite VC-dimension, and 
Xi is a sequence of i.i.d. random variables with common 
probability distribution PD, then for every e, 5 > 0, 

Vxob{\N{F)- PD{F)\<e\lF eT ) > 1-5, (33) 

for sufficiently large n. The VC-theorem ( I33> is a stronger 
statement than the weak law of large numbers due to the 
uniformity over all events. 

The proof [37] is developed by first defining the growth 
function m{n), which is the maximum number of subsets of 
an n sized sample X generated by the set of events J^. It is 
then proved, for sets with finite VC-dimension vc, that the 
growth function is upper bounded as 



m{n) < rC"^ + 1 s.t. n > vc. 



(34) 



The probability that the relative frequency of a particular event 
F exceeds its expected value by e is exponentially decreasing 
in the sample size n. To bound the probability over the class of 
events T a union bound is applied. However the union is now 
taken over m{n) events since only m{n) events are distinct. 
Thus 



(35) 



Prob sup \N{F) - PD{F)\ > e 
< ra{n) * e~"^ 



The finite VC-dimension implies m(n) grows slower that the 
exponential in n, and hence the weak law holds uniformly. 
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